Kernel imputation with multivariate auxiliaries

نویسنده

  • Nicklas Pettersson
چکیده

We consider a data set with missing observations but known auxiliaries for the sample and develop a real donor imputation. For each unit with missing observations we construct a distribution over a set of possible donors. We want the expectation (or distribution) to be chosen so that the expectation (or distribution) of the imputed values should equal the distribution of the units’ true values. This is obtained by letting the expected values of the auxiliaries equal the true value. Several kernel estimation features are introduced to reduce the bias associated with the unbalanced donor sets, due to sparse and bounded data sets. To get the good properties of kernel estimates to carry over, multiple imputation is used. Simulation studies indicate that our method has a good performance compared to competing methods. This is particularly noticable in notoriously difficult situations e.g. when the relationship between the study and auxiliary variables is nonlinear. The displayed simulations are based on two auxiliary variables, but the algorithm is generally formulated for any number of auxiliaries.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

Armin Iske * Scattered Data Approximation by Positive Definite Kernel Functions

Kernel functions are suitable tools for scattered data interpolation and approximation. We first review basic features of kernel-based multivariate interpolation, before we turn to the construction and the characterization of positive definite kernels and their associated reproducing kernel Hilbert spaces. The optimality of the resulting kernel-based interpolation scheme is shown. Moreover, we ...

متن کامل

Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)

Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...

متن کامل

Kernel Ridge Estimator for the Partially Linear Model under Right-Censored Data

Objective: This paper aims to introduce a modified kernel-type ridge estimator for partially linear models under randomly-right censored data. Such models include two main issues that need to be solved: multi-collinearity and censorship. To address these issues, we improved the kernel estimator based on synthetic data transformation and kNN imputation techniques. The key idea of this paper is t...

متن کامل

An Unsupervised Multivariate Time Series Kernel Approach for Identifying Patients with Surgical Site Infection from Blood Samples

A large fraction of the electronic health records consists of clinical measurements collected over time, such as blood tests, which provide important information about the health status of a patient. These sequences of clinical measurements are naturally represented as time series, characterized by multiple variables and the presence of missing data, which complicate analysis. In this work, we ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013